L-ITR 



Kan 



pBR322 ori 



R-ITR 




pWWSGl HPRT 

13612 bp ^^^^^^ 



Left Inverted Terminal Repeat: 1 - 1 03 

Encapsidation Signal Q¥): 183-331 

HPRT Introns: 365-10557 

Right Inverted Terminal Repeat: 
10571-10673 

pBR322ori: 10877-11544 

Kanamycin Resistance Gene: 12353- 
13144 



FIG. 1 



SEQ ID NO:l pSHuttle Sequence 



CATCATCAATAATATACCTTATTTTGGATTGAAGCCAATATGATAATGAGGGGGTGGAGTTTGTGACG 

TGGCGCGGGGCGTGGGAACGGGGCGGGTGACGTAGTAGTGTGGCGGAAGTGTGATGTTGCAAGTGTG 

GCGGAACACATGTAAGCGACGGATGTGGCAAAAGTGACGTTTTTGGTGTGCGCCGGTGTACACAGGA 

AGTGACAATTTTCGCGCGGTTTTAGGCGGATGTTGTAGTAAATTTGGGCGTAACCGAGTAAG 

CCATTTTCGCGGGAAAACTGAATAAGAGGAAGTGAAATCTGAATAATTTTGTGTT 

AATACTGGTACCGCGGCCGCCTCGAGTCTAGAACTAGTGGATCCCCCGGGCTGCAGGAATTCTGATGG 
CTCTCAAAATTCCTGCCTCCTTTAGGGATAAAAGACTTTAAGACTT^^ 

AAAAAAATTCCTGCCTCCTGGTGTACACACACAGAAGGGTTCCCTCCCCTTGAATGTGACCAGGATCT 

GTGAAAATAACGGGATAGCCGCTCCTGTGATTAGGTTATGTGGTAGACTAGAGCAAGATTCTCCTGCT 

GGTTTTGAAGAAGTCAGCTGCCATGTTGTGAGACTGTCATGGGCTAGGGCATGAGCCTTTAAATATCT 

GGGAGCAACCCCTGGCCAGCAGCCAGTGAGAAAACGGGCCCTCAGTCCTACAATCACAAGGAACTAA 

ATTCTGCCAACAACCTGAAGGAACTTTGAAGAGGATCATGAGTCCCTTGATTCAGCTTGATGAGCCCC 

TGAGCAGAGGATACAGCTAACTTGTACTAGGGAAGTATAAAAAACATGCATGGGAATGATATATATC 

AACTTTAAGGATAATTGTCATACTTCTGGGAATGAAGGGAAAGAAATGGGGCTTTAGTTGTATTATGA 

TCTTTAATTTCTCAAAAAAAATAAGATCAGAAGCAAATATGGCAAAATGT^ 

GTAGGTATTCAGCATACCCTTTTTTCTGAGTTCAAAATATTTTATAATTAA 

CACAGTGGCTCATGCCTATAATACCAGCACTTTGCGAGGCCGAGGTGGGAGGATGGCTTGAGGCCAGA 

CCAGCCTGGCCAACATGGCAAAACCCCATCTCTACTTAAAAAAAAAAAAACTATATATATATATATGT 

GTGTGTGTGTGTATATATATATATGTATATATATTTATATATGTGTGTATATATATATATGTATATATAT 

TTATATATGTGTGTGTATATATATATATACACACACACACATATATACATACATACATACACACACACA 

CACACACAATTAGCCAGGCATGGTGGCGCACACCTGTAGTCCCAGCTACTTGGGAGGCTGAGACATGA 

GAATTGCTTGAACCTGGGAGGCAGAGTAGTTAGTGAGCTGAGATCATACCACTGCACTCCAGCCTGGT 

GACAGAGTGAGACTCTGTCTTAAAAAjVAATAAAAATTAAAATTAAATGCA^^^ 

gaagaggaaaggggtatcaaggaaggttttgtggaggtgacgtttgagctgggtcttaaatgact^ 

aacatgggataagaagggagggaataaggacatttcaggtacgagaaataaggagcatcagtggaa 

acaacctaacgtctgtcaaccagtgaatggataacaaaaatgtaattcagatggtatccaagttacga 

TGGTTCCAACATGAGATTTTTCTGACTTTAGGATAGATTTATCAAAGTAGTA^ 

TGATATTTTCAACTTCAGATGGGTTTATCAGGACACAGTTGAGGAACACCTGTCTATCCATACAAT^ 

GCAATAAAAAGGAAATGAGTGCAGATATACTCCACAACATGAATGAACCTTGAAAACATTAAGTGAG 

AGAAGCCAGATACAAAAGGCCACATATTGTATGATTCTATTTATACAAAATGTCCAGAATAGGCAAAT 

CTTATAGACAGCAAGTAGGTAGATGATCAGTTTGCTAGGTGCTGGGGGAAGGGGAAATGGGGAGTGA 

TGGCTAAGGGGATTGGGTTTCTTTGTGGGGAAATGAAAATGTTTTAAAATTGAGCGTGAT/^ 

ctcaatgctgcatatatatataatctatagattatatatatataaagagaggctgttagacagtgata 

agtgatatatatatatatatacatagagagagagagagagagagagagaggctgttagtgataagtg 

atcaggaaaataaaagtattgaggaggaatacgaagttgacggtgtgaaaacatgagattt^ 
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GGATGGCCAGGGAAGGCCTTAATGAGAAAGTGACTTATGAGTAAAAACAAGGGATCCTAAACC^ 

CATGCATCAGAATCACTCGGAAACTTGTTAAAGCATAGCTTGCTGGGCCTCATCACAGATATTTTGATT 

CGGTAGGTTCTTGTCTGATATTAATACTTTTGGTCTAGGGAACCACATTTTGAGAACCA 

GGAAGTAAAGGTTTCCCTTAGTTTACTAGCTGGTAACCCTAGGAAACTGCTTAGCCTCTCGGTGCTA^ 

ATACAAAATACTTTAGCACATAATAACACATGGAAj^ATAGTCTATAAATTATAAATAT^ 

TACCAAATATTACATAAGACAAAATCTAAGCAAGATATATATATATATACATAAAATATAAGATATAT 

ATGTATATATTATATATAGATAAATAGAGAGAGAGAGTTATGTTTAGAAAGAAAATACTTCAAACTAA 

AAAAAGAGAGGTAGGAAGTATACCATTCCATTATTGGTAAAAACAAATTACTAAGTAGTCT^ 

AAACCAATCrCACTCCTTTAGAACACAAGCCCACCATTAAy^ 

GCTTACCTTTAGGATGGTGCATACTAAGTTAGAAAAGTCATAAATGTTATATTA^ 

CTTACTTCCACAATCAAGACATTCTAGAAGAAAAAGAGAAATGAAAATCAGTACAATGAATAJ^^ 

GTATTTCCAATTATAAGTCAAATCACATCATAACAACCCTAAGGAATTATCCAAACTCTTGT^^ 

TGCTTTATTATATCAAACTCTCCTTTAAACAAGTGGCCCATCTGCTGGGATTTGGAAGCCTGTAATA 

GAAATTTTCATCATAATGGAAATTTTAAAAACAGAATTTGACCCACCTGTTT^ 

CTTAACAJVGAGGTCTAATCTTGGGCAAGTCTTGAAATTTCTCTGGCCTTAGTTTCCCATGTGTTAAATG 

AAACTTGAAGCAGTTGGTCTCITATAGTCTCCTGACTCT 

CTCAAAAATCACATAATTTAATTTACCATATGGACrCCAAAATATATT^ 

TCTGCATTTTCTGGATGTGTCCATATTCTTGGACTACACTAAAACATGATACCAATGCT^ 

ATAAACCCTCACTTCGCTTTCTACATTTAAGAATTTTATAGCTGGAAGAGTCCTTAAC 

TCTAATAATTACCCCTCAAAATCGAGAAAGTCCTATCTGTTCITATGCTAGTTATAAG^ 

ATTTCACATAATGGTTATAAACACTGCCACAAGAAGATTCATGATGTGTTGTTTATCTG^^ 

CATACTCTGTCATATAACTATAGCATTAAGATTTTAATGTTCTATATATTCTTCTAAGACAGTGTTTAC 

AGAGTAAGGCACAAAAGATCCACTGGTTTGCAAGAAAGATTAGAACTTTTAAATTTTTTAACCTCACC 

TTGTTTAATCTATATTTTTGTATGTATTTTGTAACATATATATTATTATTACCATAA 

AAAATGCATATATTAGGGGTAAATGCTCAGGAAACTTTTTATAAATTGGGCATGCAAATACA^ 

AAGACTCACTGTTCTAGGTATTAAAAGTAAAGTTATAACCAAGTAAAGCTTCCACCTTTTCATGTCTCA 

AAGCAGTTTATTGTTGGAGGTAAGATCTCTTAGAAGCCTAAACAGGTCCAAGTACAGAATGAAGTAAG 

GCTAGCCCATAACTTGTGGCAAGCAATTCATACTATTTCTCTCATGCTGAGCTCTCCTCAGTGAAGCAG 

CTACrATAGACAACTGCAGCCTATTGGTAGCCTATTTTACAGGCAGGAAAAAj^ 

AAGTGGAACTCAGGACATGGGGAGAAAATGAATACAAAAAATAGGGTCAATCCAAAGGCACACAGC 
AAATGAGTAACACAGTTATGTTTTTTTCCCATTTGTATGAGGTCCCAGTA>^ 

ATTTAATAATACACTAAAAAAGCCATGCAATTGTTCAAATGAATCCCAGCATGGTACAAGGAG^^ 

ACACTAGAGTCTAAAAAACAAAAGAATGCCATTATTGAGTTTTTGAATTATATCAAG^^ 

CTACTTAATAAATGAGAAAAACGAGGATAAGAGGCCATTTGATAAAATGAAAATAGCCAAGAAGTG 

TATTAGAGACTTGAATACAGGTATTCGGGTCCAAAGTTCATCTGCTCAAATACTAACTGGGGAAAAGA 

GGGAAAAATATTTATATACATATATATCTGCACACAAAAATACCCCCAAAAGACAAAATGAGGCCAG 

GCAGGGTGGCTCACACCCGTAATCCCGGTACTTTGGGAGGCTGAGGCAGGTGGATACCTGAGATCAGG 
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AGTTGGAGATCAGCCTGGTCAACATGGTGAAACCCTGTCTCTACTAAAGATAAAAAAATTAGCCAGGC 

ATGGTGGCGTGCGCCTGTAATCCCAGCTACTTGGGAGTCTGAGGCAGGAGAATCACTTGAACTGGGAA 

GGGGAGGTTGCAGTGAGCCAAGATCGTACTACTGCACTCCAGCCTGGGCAGCAGAGTGAGACTCCATC 

ACAAAAATAAATAAATAAATAAAATACAATGAAACAGAAAGTTCAAATAATCCCATAATCTT 

CAAGAAATAACTTTCACTCGTTATACTTATTGATTTTTCCATAATAAATGTACT^ 

GAAAAGAAAGTTATTTTAGAAACAGAGAACTGTTTCAGATCAAATCTATGTAGT^ 

AGGTGGGAAAGACGAGATCAAACTAAATCTCAGAAGGCCTAAAAGGCTAGGTCCATTCCAGCACTAA 

AAACTGACCAGACAAGTAATGGCTTCAACAGCTTCTAAATATGGACAAAGCATGCTGAAAGGGAAGG 

ACAGGTCTAACAGTGGTATATGAAATGAACAGGAGGGGCAAAGCTCATTTCTCCTCTGAAGTT^ 

AAGATGCTGAGGAGGACATTAGTTTGACATGACCCTGATATGGGACAAGATAATTTCACAGAAGTT^ 

ACATGTTAAAGTTTTCTTATAGATACTCATTCAAGTAAGCAATGAACACTAAAATCTAAAGAAAGAAA 

AGAGCTTTAGAGTCAGGTCTGTATTCAAATTCAAGCTCTACCACTTACTGGTTCTGTGACTTTGGGCAA 

GTCTTTTAACCTTATTAAGTCTTAATTTCCTGATTTGTAAAATGGGGATAT 

TTGTGAAACTTTTATGAGATTAATGCCTTTATATTTGGCATAGTGTAAGTAAACAATAACT 

CAAAAAAAAAAAGCAGTAGCATTCCATCATTTATTATTGGTTACTCTCAAAAAGTT^^ 

GAAGATAAATATTCAAATACCTTAATATCrCCATTATTTTCAGGTAA^ 

ATGGGTCAACAAATAAATTAAAAGGGAAATCTAAAAACATCITGATAT^ 

ATATACCAAAACCAATGGTTCACACTAGGAGAATTTTAAGGTACAAGAAAACTCTTTGAGATTTC^ 

AAATAATAGTATGTCTGAATTTATTGAGTGATTTACCAGAAACTGTTGTAAGAGCT 

AGCACTTAATCCTCTTAACTCTATGGCTGCTATTATCAACCTCACCCTAATCACATATGGGACACAGAG 

AGGTTAAGTAACTTGCCCAAGGTCAGAGTTAGGAAGTACTAAGCCATGCTTTGAATCAGTTGTCAGGC 

TCCGGAACTCACACTTTCAGCCACTACATAATACTGCTTTGCTATCTTTTAGGAAACTATGTGAGTCTA 

CCTCACATAGACTCACATAGGTTTGTTTTTTTTTTTT^^ 

TTGAAGGATCCCAAATTAGAGTCCCACAGAGGCAGACAGCAGTACTTGACAATATGGACATTTAAGGT 
TAATGTTGGATTCTACTGTCTTTTTACTACATGACCTAGGGAACGATAATTAACCTAG 
GGGTTAAATAACCCATTTAGTTATACTATGTAAATTATCTCTTAGTGATTGATTGAAAGC^ 
CrAATTGACTCGGTATGAAGTGCTTTTTTTTCr^ 

GAGATCATCTCCACCAATTACTTTTATGTCCCCTGTTGACrGGTCATTCTAGTT^^^ 

ATATATATATATATCTACACACACATATGTATATGTATATCCTTATGTACACACACAAACTTCAAATTA 

AATGAGAACTAGAAGATTTGAGAAGTTAGCTAGCTAATATCCATAGCATTATGATATTCTAAATGATA 

TGAATTATAAGAATTAGGTTTCCTGAAATGAATGACTAGAAAACTTTCAAGTAGAGATT^ 

TAAAAAGTCCTAATCGGCCATTACTGATTTGATGTTTTTAAGAGTCCTAAAAAATGGGTTACATCCAT^ 

TTTAAGTGGGTAGTATTATAACAGCCACCCATCTTCAATCACAGTGATTTCTGAATTGTGAGGGAAG 

ATTAGCATGACAGGTGTCTGGTTCTGGCCCTGTACGATTCCCATGAGTCAAGCAAATTGTAAGGGCTG 

GTCTATATCACACCCAACCCCAAGGATATGTCCCTCAAAAGTCTAGCCCAGGCCCCGTCATCTTCAGC 

ATCATCrGGGAAACCAGGTCTGATTAGTAGTCCTTTAAGGAATACCTCTTAGGCTCCCATT^ 

ATCACAGAATCCAATAAAACCCTTACAGGAGATTCAATGGGAAATGCTCAACACCCACTGTAGTTGGT 



FIG. 2B 



GGTGACAATGACCATAATTTGGCTGTGCTGGATTCAGGACAGAAAATTTGGGTGAAAGAGCAGGTGA 

ACAAAAGAGCTTCGACTTGCCCTAGCAGAGAGCAAGCCATACCATACCACAAAGCCACAGCAATTAC 

AACGGTGCAGTACCAGCACAGTAAATGAACAAAGTAGAGCCCAGAAACAGACCCAGAACTATATGAG 

GATTTAGTATACAATAAAGATGGTATTTCGAGTCAGTAGGGAAAAGATGAATTATTCAATAAAT 

TTTGGCCAACTAGTAACCCATTTGGGAAAAAATAAAAGTATGGTCCCTACCTCACAGCATACACAAAA 

ATAAATTCCAGACGGATTAAAATCTAAATGTAAAAAATAAAGCCATAAGTGGACTGGAAGA^^ 

AGAATTTTTTTTAACATCCGTAGAAAGGGTAAAAACCCAGGCATGAC^ 

TTCTGTAACAAATACCCCCTTTTATATATTGGGCTCCAACAATAAGAACCCAT^ 

GAACACAAATAGACAATTTATAGAAGAGAAGGTTATAAGGTGTAAAATTATATCTATCTGAG 

AACACTAAAACAATGTGAITCTACTGTTCTCCCACCCATACTGGCAAAACTTAAGCCTGA^ 

GAGGGGAAATAAGCACTCTTGTTGGTGAGAGTATTAATTGGCATAGCTTCIT^ 

AATACCTGTTAAAATTGCAAACATGCATGTCACTTAATTCCATGTAATTCCCACTTCTGGGAATCA^ 

GCTACAAAAACACTTGACAAGTATACAAAGATACATTCAAGAGTGTTCACTGGGCCGGGTGCGGTGGC 

TTCATGCCTGTAATCCCAGGGAGGCAGAGGCAAGACGATCGCTTGACCCCAGGAGTTCAAGGCCAGCC 

CGAGAAACACAGCAAGACCCTGTCTCTCTTTTTTTTATT^ 

TCACAAAAACAAACCAACATGTCCATTAACAGGGAACCATTTAAATTAATCAAGTTCATCT^ 

GTAATACCATGCAACTATTAAAAAGCACCTGATAATCCAAAGCACACTGAGACAGAATAATGCTAT^ 

AAAACACCAAGTAGTGGAACACTGTGTTGCCTATGACACCATTTTTATTCAACATT^ 

AACAGCAATTACATGAGTAGTGACAATGGCGTTTATGAGACmTTCACTTTTATGTG 

TATGCTTCTATATATACATCCATTTATTATGGAGTGTTACTTTCAAAAATCACAAATGGGCCAGTA 

TTTGGTGTTGCAAGGTGAGCATATGACTTCTGATATCAACCTTTGCATATTACTTCTCAATTTAGGGAA 

ATTACAGACATCCCTTATTCTAACTAACTTAAAACCCAGCATTTCAAACATACAGAATTGATGGGGA 

AAAAAAGAAAGAAGAAAGAAAGAAAAGGCAACAAGCTTCAGATGACAGTGACTCACATCAAATTATT 

TATAAAATCTGTTAAATAGTGCCATCTTCTGGAGATACCTGGTATTACAGTCCAACTCCAGTTGATGTC 

TTTACAGAGACAAGAGGAATAAAGGAAAAAATATTCAAGAACTGAAAAGTATGGAGTCAT^ 

TTGCTGTGATCCAAAGGCTACGGTGATAGGACAAGAAACAAGAGAACTCCAAGCAGTAAGACACTGC 

TGTTCTATTAGCATCCAAACCTCCATACCTCCTGTTTGCCCCAAGGCTTTT^ 

ATCTCACTATTTTGCTCAGGCTGGTCTTGAACTCCTGGACTCAAGCTATCCTCCTGCCTCGGCCTCCT 
AGTGCCGAGATTACAGGCTTGAGTCACCATACCTGGCTATTTATTTTTTCI^ 

ATAGCCACCATGGAAGCTAATAAAGAATATTAATTTAAGAGTAATGGTATAGTTCACTACATTGG^ 

ACAGGTATAAGTGCCTACATTGTACATGAATGGCATACATGGATCAATTACCCCACCTGGGTGGCCAA 

AGGAACTGCGCGAACCTCCCTCCTTGGCTGTCTGGAACAAGCTTCCCACTAGATCCCTTTACTGAGTGC 

CTCCCTCATCTTTAATTATGGTTAAGTCTAGGATAACAGGACTGGCAAAGGTGAGGGGAAAGCTTCCT 

CCAGAGTTGCTCTACCCTCTCCTCTACCGTCCTATCTCCTCACTCCTCTCAGCCAAGGAGTCCAATCTGT 

CCTGAACTCAGAGCGTCACTGTCAACTACATCAAAATTGCCAGAGAAGCTCTTTGGGACTACAAACAC 

ATACCCTTAATGTCnTTATTTCTATTTTGTCTACCTCTTCAGTCT^ 

AGGGAAGAACTTTGTTTATGCCTACnTATCCGCCCCTAGGAATTT^ 
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GAACTGCAGCATGGTATAGAAAAAGAGGAGGAAAGCTGTATAGAAATGCATAATAAATGGGCAGGA 

AAAGAACTGCTTGGAACAAACAGGGAGGTTGAACTATAAGGAGAGAAAGCAGAGAGGCTAATCAAC 

AAGGCTGGGTTCCCAAGAGGGCATGATGAGACTATTACTAAGGTAGGAATTACTAAGGGCTCCATGTC 

CCCTTAGTGGCTTAGTACTATGTAGCTTGCITTCTGCAGTGAACTTCAGACCCTTCTT^ 

AATGGACTTTTTTTTTTTATCGGAAAACAGTCATTCT 

ACrCAATCACATTTTCrCITCATATCATAATCrCTCAACCA^ 

CCTGATCAAATGCCAACAAAAAGTGAGAATGTTAGAATCATGTATTTTTAGAGGTAG 

GATAAAAAAAAAGGGGCAGATATTCCATTTTCCAAAATATGTATGCAGAAAAAATAAGTA 

ACATATGCTCAGGTAACAAGTTAATTTGTTTACTTGTATTTTATGAATT^^ 

GCCCCGTTCCCACGCCCCGCGCCACGTCACAAACTCCACCCCCTCATTATCATATTGGCTTCAAT^ 

AATAAGGTATATTATTGATGATGTTAATTAACATGCATGGATCCATATGCGGTGTGAAATACCGCACA 

GATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCG 

GTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGG 

GGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGC 

GTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGA 

GGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCT 

CCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGG^ 

ATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAAC 

CCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACG 

ACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACA 

GAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCT 

GAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCG 

GTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCT^ 

TTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATC 

AAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATC/^ 

AGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTC 

GTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCC 

CCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCrCCAGATTTATCAGCAATAAACCAGCCA 

GCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTG 

CCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGCCA 

TGAGATTATCAAAAAGGATCTTCACCTAGATCCITTTCACGTAGAAAGCCAGTCCGCAGAA^ 

TGACCCCGGATGAATGTCAGCTACTGGGCTATCTGGACAAGGGAAAACGCAAGCGCAAAGAGAAAGC 

AGGTAGCTTGCAGTGGGCTTACATGGCGATAGCTAGACTGGGCGGTTTTATGGACAGCAAGCGAACCG 

GAATTGCCAGCTGGGGCGCCCTCTGGTAAGGTTGGGAAGCCCTGCAAAGTAAACTGGATGGCTTTCTT 

GCCGCCAAGGATCTGATGGCGCAGGGGATCAAGCTCTGATCAAGAGACAGGATGAGGATCGTTTCGC 

ATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 

CTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGG 
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TTCri-ri'lGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAAGACGAGGCAGCGCGGCTATCG 

TGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTG 

GCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTAT 

CCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAA 

GCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGA 

CGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGAGCATGCCCGACGGC 

GAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCT 

GGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGA 

TATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCC^ 

TTCGCAGCGCATCGCCnrCTATCGCCTTCnTGACGAGTTC^ 

CAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAA 

TAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAA 

GGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTAATCAAGTTTTTTGGG 

GTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGA 

AAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCA 

AGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTC 

CATTCGCCATTCAGGATCGAATTAATTCTTAATTAA 
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SEQ ID NO:2 Human TM amino acid sequence 



MLGVLVLGALALAGLGFPAPAEPQPGGSQCVEHDCFALYPGPAT 
FLNASQICDGLRGHLMTVRSSVAADVISLLLNGDGGVGRRRLWIGLQLPPGCGDPKR 
LGPLRGFQWVTGDNNTSYSRWARLDLNGAPLCGPLCVAVSAAEATVPSEPIWEEQQ 
CEVKADGFLCEFHFPATCRPLAVEPGAAAAAVSITYGTPFAARGADFQALPVGSSAA 
VAPLGLQLMCTAPPGAVQGHWAREAPGAWDCSVENGGCEHACNAIPGAPRCQCPA 
GAALQADGRSCTASATQSCNDLCEHFCVPNPDQPGSYSCMCETGYRLAADQHRCED 
VDDCILEPSPCPQRCVNTQGGFECHCYPNYDLVDGECVEPVDPCFRANCEYQCQPLN 
QTSYLCVCAEGFAPIPHEPHRCQMFCNQTACPADCDPNTQASCECPEGYILDDGFICT 
DIDECENGGFCSGVCHNLPGTFECICGPDSALARHIGTDCDSGKVDGGDSGSGEPPPS 
PTPGSTLTPPAVGLVHSGLLIGISIASLCLVVALLALLCHLRKKQGAARAKMEYKCAA 
PSKEVVLQHV RTERTPQRL 
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SEQ ID NO:3 human TM nucleotide sequence 

atgcttggg gtcctggtcc ttggcgcgct ggccctggcc ggcctggggt tccccgcacc cgcagagccg cagccgggtg 
gcagccagtg cgtcgagcac gactgcttcg cgctctaccc gggccccgcg accttcctca atgccagtca gatctgcgac 
ggactgcggg gccacctaat gacagtgcgc tcctcggtgg ctgccgatgt catttccttg ctactgaacg gcgacggcgg 
cgttggccgc cggcgcctct ggatcggcct gcagctgcca cccggctgcg gcgaccccaa gcgcctcggg cccctgcgcg 
gcttccagtg ggttacggga gacaacaaca ccagctatag caggtgggca cggctcgacc tcaatggggc tcccctctgc 
ggcccgttgt gcgtcgctgt ctccgctgct gaggccactg tgcccagcga gccgatctgg gaggagcagc agtgcgaagt 
gaaggccgat ggcttcctct gcgagttcca cttcccagcc acctgcaggc cactggctgt ggagcccggc gccgcggctg 
ccgccgtctc gatcacctac ggcaccccgt tcgcggcccg cggagcggac ttccaggcgc tgccggtggg cagctccgcc 
gcggtggctc ccctcggctt acagctaatg tgcaccgcgc cgcccggagc ggtccagggg cactgggcca gggaggcgcc 
gggcgcttgg gactgcagcg tggagaacgg cggctgcgag cacgcgtgca atgcgatccc tggggctccc cgctgccagt 
gcccagccgg cgccgccctg caggcagacg ggcgctcctg caccgcatcc gcgacgcagt cctgcaacga cctctgcgag 
cacttctgcg ttcccaaccc cgaccagccg ggctcctact cgtgcatgtg cgagaccggc taccggctgg cggccgacca 
acaccggtgc gaggacgtgg atgactgcat actggagccc agtccgtgtc cgcagcgctg tgtcaacaca cagggtggct 
tcgagtgcca ctgctaccct aactacgacc tggtggacgg cgagtgtgtg gagcccgtgg acccgtgctt cagagccaac 
tgcgagtacc agtgccagcc cctgaaccaa actagctacc tctgcgtctg cgccgagggc ttcgcgccca ttccccacga 
gccgcacagg tgccagatgt tttgcaacca gactgcctgt ccagccgact gcgaccccaa cacccaggct agctgtgagt 
gccctgaagg ctacatcctg gacgacggtt tcatctgcac ggacatcgac gagtgcgaaa acggcggctt ctgctccggg 
gtgtgccaca acctccccgg taccttcgag tgcatctgcg ggcccgactc ggcccttgcc cgccacattg gcaccgactg 
tgactccggc aaggtggacg gtggcgacag cggctctggc gagcccccgc ccagcccgac gcccggctcc accttgactc 
ctccggccgt ggggctcgtg cattcgggct tgctcatagg catctccatc gcgagcctgt gcctggtggt ggcgcttttg 
gcgctcctct gccacctgcg caagaagcag ggcgccgcca gggccaagat ggagtacaag tgcgcggccc cttccaagga 
ggtagtgctg cagcacgtgc ggaccgagcg gacgccgcag agactc 
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SEQ ID NO: 4 



GTTTAAACGGGCCCTCTAGACGCGTTGACATTGATrATl^GACTAGTl\^lTAATAGTAATCAA^ 

GTCATTAGTTCATAGCCCATGATATCATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCC 

TGGCTGACCGCCrAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCC^^^ 

TAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACT^^ 

GTGTATCATATGCCAAGTACGCCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATG 

CCCAGTNCATGACCTTATGGGACTTTCCTACITGGCAGACATCTACGTAT^ 

GGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATT^ 

CTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGT 

AACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAG 

CTCTCTGGCTAACTAGAGAACCCCTGCTTACTGGCTTATCGAGATATCTGCAGAATTCATCTGTCGACT 

GCTACCGGCAGCGCGCAGCGGCAAGAAGTGTCTGGGCTGGGACGGACAGGAGAGGCTGTCGCCATCG 

GCGTCCrGTGCCCCTCTGCTCCGGCACGGCCCTGTCGCAGTGCCCGCGCTTTCCCCGGCGCCTGCACGC 

GGCGCGCCTGGGTAACATGCTTGGGGTCCTGGTCCTTGGCGCGCTGGCCCTGGCCGGCCTGGGGTTCC 

CCGCACCCGCAGAGCCGCAGCCGGGTGGCAGCCAGTGCGTCGAGCACGACTGCTTCGCGCTCTACCCG 

GGCCCCGCGACCTTCCTCAATGCCAGTCAGATCTGCGACGGACTGCGGGGCCACCTAATGACAGTGCG 

CTCCTCGGTGGCTGCCGATGTCATTTCCTTGCTACTGAACGGCGACGGCGGCGTTGGCCGCCGGCGCCT 

CTGGATCGGCCTGCAGCTGCCACCCGGCTGCGGCGACCCCAAGCGCCTCGGGCCCCTGCGCGGCTTCC 

AGTGGGTTACGGGAGACAACAACACCAGCTATAGCAGGTGGGCACGGCTCGACCTCAATGGGGCTCC 

CCTCTGCGGCCCGTTGTGCGTCGCTGTCTCCGCTGCTGAGGCCACTGTGCCCAGCGAGCCGATCTGGGA 

GGAGCAGCAGTGCGAAGTGAAGGCCGATGGCTTCCTCTGCGAGTTCCACTTCCCAGCCACCTGCAGGC 

CACTGGCTGTGGAGCCCGGCGCCGCGGCTGCCGCCGTCTCGATCACCTACGGCACCCCGTTCGCGGCC 

CGCGGAGCGGACTTCCAGGCGCTGGCGGTGGGCAGCTCCGCCGCGGTGGCTCCCCTCGGCTTACAGCT 

AATGTGCACCGCGCCGCCCGGAGCGGTCCAGGGGCACTGGGCCAGGGAGGCGCCGGGCGCTTGGGAC 

TGCAGCGTGGAGAACGGCGGCTGCGAGCACGCGTGCAATGCGATCCCTGGGGCTCCCCGCTGCCAGTG 

CCCAGCCGGCGCCGCCCTGCAGGCAGACGGGCGCTCCTGCACCGCATCCGCGACGCAGTCCTGCAACG 

ACCTCTGCGAGCACTTCTGCGTTCCCAACCCCGACCAGCCGGGCTCCTACTCGTGCATGTGCGAGACC 

GGCTACCGGCTGGCGGCCGACCAACACCGGTGCGAGGACGTGGATGACTGCATACTGGAGCCCAGTC 

CGTGTCCGCAGCGCTGTGTCAACACACAGGGTGGCTTCGAGTGCCACTGCTACCCTAACTACGACCTG 

GTGGACGGCGAGTGTGTGGAGCCCGTGGACCCGTGCTTCAGAGCCAACTGCGAGTACCAGTGCCAGCC 

CCTGAACCAAACTAGCTACCTCTGCGTCTGCGCCGAGGGCTTCGCGCCCATTCCCCACGAGCCGCACA 

GGTGCCAGATGTTTTGCAACCAGACTGCCTGTCCAGCCGACTGCGACCCCAACACCCAGGCTAGCTGT 

GAGTGCCCTGAAGGCTACATCCTGGACGACGGTTTCATCTGCACGGACATCGACGAGTGCGAAAACGG 

CGGCTTCTGCTCCGGGGTGTGCCACAACCTCCCCGGTACCTTCGAGTGCATCTGCGGGCCCGACTCGGC 

CCTTGCCCGCCACATTGGCACCGACTGTGACTCCGGCAAGGTGGACGGTGGCGACAGCGGCTCTGGCG 

AGCCCCCGCCCAGCCCGACGCCCGGCTCCACCTTGACTCCTCCGGCCGTGGGGCTCGTGCATTCGGGC 

TTGCTCATAGGCATCTCCATCGCGAGCCTGTGCCTGGTGGTGGCGCTTTTGGCGCTCCTCTGCCACCTG 

CGCAAGAAGCAGGGCGCCGCCAGGGCCAAGATGGAGTACAAGTGCGCGGCCCCTTCCAAGGAGGTAG 

TGCTGCAGCACGTGCGGACCGAGCGGACGCCGCAGAGACTCTGAGCGGCCTCCGTCCAGGAGCCTGG 

CTCCGTCCAGGAGCCTGTGCCTCCTCACCCCCAGCTTTGCTACCAAAGCACCTTAGCTGGCATTACAGC 

TGGAGAAGACCCTCCCCGCACCCCCCAAGCTGTTTTCTTCTATTCCATGGCTAACTGGCGAGGGGGTG 

ATTAGAGGGAGGAGAATGAGCCTCGGCCTCTTCCGTGACGTCACTGGACCACTGGGCAATGATGGCAA 

TTTTGTAACGAAGACACAGACTGCGATTTGTCCCAGGTCCTCACTACCGGGCGCAGGAGGGTGAGCGT 

TATTGGTCGGCAGCCTTCTGGGCAGACCTTGACCTCGTGGGCTAGGGATGACTAAAATATTTATTTTTT 

TTAAGTATTTAGGTTTTTGTTTGTTTCCTTTGTTCT^ 

CTCCGGTCTCTCTCTCTCTACAAACTCCCACTTGTCATGTGACAGGTAAACTATCTTGGTGAATTTrriT 

TTCCTAGCCCTCTCACATTTATGAAGCAAGCCCCACTTATTCCCCATTCTTCCTAGTTT^ 

GAACTGGGCCAACTCACCTGAGTCACCCTACCTGTGCCTGACCCTACTTCTTTTGCTCTTAGCTGTCTG 

CTCAGACAGAACCCCTACATGAAACAGAAACAAAAACACTAAAAATAAAAATGGCCATTTGCTTTTTC 

ACCAGATTTGCTAATTTATCCTGAAATTTCAGATTCCCAGAGCAAAATAATT^ 

ATGTAAAAGGTATTAAATTGATGTTGCTGGACTGTCATAGAAATTACACCCAAAGAGGTATTTATCTT^ 
ACTTTTAAACAGTGAGCCTGAATTTTGTTGCTGTTTTGATT^ 
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TCTTCTTATGCAATTTCCTTTTTTGTTATTATTACTTATT^^ 

GCTCTAGATTGAGAGAAGAGACAAACACCTCCCAGGAGACAGTTCAAGAAAGCTTCAAACTGCATGA 

TTCATGCCAATTAGCAATTGACTGTCACTGTTCCTTGTCACTGGTAGACCAAAATAAAACCAGCTCTAC 

TGGTCTTGTGGAATTGGGAGCTTGGGAATGGATCCTGGAGGATGCCCAATTAGGGCCTAGCCTTAATC 

AGGTCCTCAGAGAATTTCTACCATTTCAGAGAGGCCTTTTGGAATGTGGCCCCTGAACAAGAATTGGA 

AGCTGCCCTGCCCATGGGAGCTGGTTAGAAATGCAGAATCCTAGGCTCCACCCCATCCAGTTCATGAG 

AATCTATATTTAACAAGATCTGCAGGGGGTGTGTCTGCTCAGTAATTTGAGGACAACCATTCCAGACT 

GCTTCCAATTTTCTGGAATACATGAAATATAGATCAGTTATAAGTAGCAGGCCAAGTCAGGCCCTTATT 

TTCAAGAAACTGAGGAATTTTCITTGTGTAGCTTTGCTCmTGGT^ 

AGACACTGCCACACAGGGTCTGCAAGGTCTTTGGTTCAGCTAAGCTAGGAATGAAATCCTGC^ 

GTATGGAAATAAATGTATCATAGAAATGTAACTTTTGTAAGACAAAGGTTTTCCTCTTCTATT^ 

ACTCAAAATATTTGTACATAGTTATTTATTTATTGGAGATAATCTAGAACACAGGCAAAAT 

ATGACATCACTTGTACAAAATAAACAAATAACAATGTGAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAGGTAGCAGTCGACAGATGAATTCCACCACACTGGACTAGTGGATCCGAGCTCGGT 

ACCAAGCTTAAGTTTAAAC 
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SEQ ID NO 5 



TCTAGACGCGTTGACATTGATTAlTGACTAGTTATrAATAGTAATCAATlACGGGGTCATrAGTrCAm 

GCCCATGATATCATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCC 

AACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCA 

TTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATC^^ 

CAAGTACGCCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTNCATGAC 

CTTATGGGACTTTCCTACTTGGCAGACATCTACGTATTAGTCATCGCTATTACCATGGTGA 

TGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTTCrAAGTCTC^^^ 

ACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCC 

CCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCTCTGGCTAAC 

TAGAGAACCCCTGCTTACTGGCTTATCGAGATATC 
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* SEQ ID NO 6 



GGCAGCGCGCAGCGGCAAGAAGTGTCTGGGCTGGGACGGACAGGAGAGGCTGTCGCCATCGGCGTCC 

TGTGCCCCTCTGCTCCGGCACGGCCCTGTCGCAGTGCCCGCGCTTTCCCCGGCGCCTGCACGCGGCGCG 

CCTGGGTAACATGCTTGGGGTCCTGGTCCTTGGCGCGCTGGCCCTGGCCGGCCTGGGGTTCCCCGCACC 

CGCAGAGCCGCAGCCGGGTGGCAGCCAGTGCGTCGAGCACGACTGCTTCGCGCTCTACCCGGGCCCCG 

CGACCTTCCTCAATGCCAGTCAGATCTGCGACGGACTGCGGGGCCACCTAATGACAGTGCGCTCCTCG 

GTGGCTGCCGATGTCATTTCCTTGCTACTGAACGGCGACGGCGGCGTTGGCCGCCGGCGCCTCTGGAT 

CGGCCTGCAGCTGCCACCCGGCTGCGGCGACCCCAAGCGCCTCGGGCCCCTGCGCGGCTTCCAGTGGG 

TTACGGGAGACAACAACACCAGCTATAGCAGGTGGGCACGGCrCGACCTCAATGGGGCTCCCCTCTGC 

GGCCCGTTGTGCGTCGCTGTCTCCGCTGCTGAGGCCACTGTGCCCAGCGAGCCGATCTGGGAGGAGCA 

GCAGTGCGAAGTGAAGGCCGATGGCTTCCTCTGCGAGTTCCACTTCCCAGCCACCTGCAGGCCACTGG 

CTGTGGAGCCCGGCGCCGCGGCTGCCGCCGTCTCGATCACCTACGGCACCCCGTTCGCGGCCCGCGGA 

GCGGACTTCCAGGCGCTGCCGGTGGGCAGCTCCGCCGCGGTGGCTCCCCTCGGCTTACAGCTAATGTG 

CACCGCGCCGCCCGGAGCGGTCCAGGGGCACTGGGCCAGGGAGGCGCCGGGCGCTTGGGACTGCAGC 

GTGGAGAACGGCGGCTGCGAGCACGCGTGCAATGCGATCCCTGGGGCTCCCCGCTGCCAGTGCCCAGC 

CGGCGCCGCCCTGCAGGCAGACGGGCGCTCCTGCACCGCATCCGCGACGCAGTCCTGCAACGACCTCT 

GCGAGCACTTCTGCGTTCCCAACCCCGACCAGCCGGGCTCCTACTCGTGCATGTGCGAGACCGGCTAC 

CGGCTGGCGGCCGACCAACACCGGTGCGAGGACGTGGATGACTGCATACTGGAGCCCAGTCCGTGTCC 

GCAGCGCTGTGTCAACACACAGGGTGGCTTCGAGTGCCACTGCTACCCTAACTACGACCTGGTGGACG 

GCGAGTGTGTGGAGCCCGTGGACCCGTGCTTCAGAGCCAACTGCGAGTACCAGTGCCAGCCCCTGAAC 

CAAACTAGCTACCTCTGCGTCTGCGCCGAGGGCTTCGCGCCCATTCCCCACGAGCCGCACAGGTGCCA 

GATGTTTTGCAACCAGACTGCCTGTCCAGCCGACTGCGACCCCAACACCCAGGCTAGCTGTGAGTGCC 

CTGAAGGCTACATCCTGGACGACGGTTTCATCTGCACGGACATCGACGAGTGCGAAAACGGCGGCTTC 

TGCTCCGGGGTGTGCCACAACCTCCCCGGTACCTTCGAGTGCATCTGCGGGCCCGACTCGGCCCTTGCC 

CGCCACATTGGCACCGACTGTGACTCCGGCAAGGTGGACGGTGGCGACAGCGGCTCTGGCGAGCCCCC 

GCCCAGCCCGACGCCCGGCTCCACCTTGACTCCTCCGGCCGTGGGGCTCGTGCATTCGGGCTTGCTCAT 

AGGCATCTCCATCGCGAGCCTGTGCCTGGTGGTGGCGCTTTTGGCGCTCCTCTGCCACCTGCGCAAGAA 

GCAGGGCGCCGCCAGGGCCAAGATGGAGTACAAGTGCGCGGCCCCTTCCAAGGAGGTAGTGCTGCAG 

CACGTGCGGACCGAGCGGACGCCGCAGAGACTCTGAGCGGCCTCCGTCCAGGAGCCTGGCTCCGTCCA 

GGAGCCTGTGCCTCCTCACCCCCAGCTTTGCTACCAAAGCACCTTAGCTGGCATTACAGCTGGAGAAG 

ACCCTCCCCGCACCCCCCAAGCTGTTTTCTTCTATTCCATGGCTAACTGGCGAGGGGGTGATTAGAGGG 

AGGAGAATGAGCCTCGGCCTCTTCCGTGACGTCACTGGACCACTGGGCAATGATGGCAATTTTGTAAC 

GAAGACACAGACTGCGATTTGTCCCAGGTCCTCACTACCGGGCGCAGGAGGGTGAGCGTTATTGGTCG 

GCAGCCnrCTGGGCAGACCTTGACCTCGTGGGCTAGGGATGACTAAAATATTTATTTT^^ 

TAGGTTTTTGTTTGTTTCCTTTGTTCTTACCTGTATGTCTCCAGTATCCACT^ 

CTCTCTCTCTACAAACTCCCACTTGTCATGTGACAGGTAAACTATCTTGGTGAATTTTTTTTTCCTAGCC 

CTCTCACATTTATGAAGCAAGCCCCACTTATTCCCCATTCTTCCTAGTTTTCTCCTCCCAGGAACTGGGC 

CAACTCACCTGAGTCACCCTACCTGTGCCTGACCCTACTTCTTTTGCTCTTAGCTGTCTGCTCAGACAG 

AACCCCTACATGAAACAGAAACAAAAACACTAAAAATAAAAATGGCCATTTGC^^ 

GCTAATTTATCCTGAAATTTCAGATTCCCAGAGCAAAATAATTTTAAACAAAGGTTGAGATGTAAAAG 

GTATTAAATTGATGTTGCTGGACTGTCATAGAAATTACACCCAAAGAGGTATTTATCTTTACT^^ 

CAGTGAGCCTGAATTTTGTTGCTGTTTTGATTTGTACTGAAAAATGGTAAT^ 

CAATTTCCTTTTTTGTTATTATTACTTATTTT^ 

GAGAGAAGAGACAAACACCTCCCAGGAGACAGTTCAAGAAAGCTTCAAACTGCATGA 

TTCATGCCAATTAGCAATTGACTGTCACTGTTCCTTGTCACTGGTAGACCAAAATAAAACCAGCTCTAC 

TGGTCTTGTGGAATTGGGAGCTTGGGAATGGATCCTGGAGGATGCCCAATTAGGGCCTAGCCTTAATC 

AGGTCCTCAGAGAATTTCTACCATTTCAGAGAGGCCTTTTGGAATGTGGCCCCTGAACAAGAATTGGA 

AGCTGCCCTGCCCATGG<3AGCTGGTTAGAAATGCAGAATCCTAGGCTCCACCCCATCCAGTTCATGAG 

AATCTATATTTAACAAGATCTGCAGGGGGTGTGTCTGCTCAGTAATTTGAGGACAACCATTCCAGA 

GCTTCCAATTTTCTGGAATACATGAAATATAGATCAGTTATAAGTAGCAGGCCAAGTCAGGCCCTTATT 

TTCAAGAAACTGAGGAATTTTCTTTGTGTAGCTTTGCTCTTTGGTAGAAAAGGCTAGGTACAC 

AGACACTGCCACACAGGGTCTGCAAGGTCTTTGGTTCAGCTAAGCTAGGAATGAAATCCTGCTTCAGT 
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GTATGGAAATAAATGTATCATAGAAATGTAACTTTTGTAAGACAAAGGTTTTCCTCnTCTATT^ 
ACTCAAAATATTTGTACATAGTTATTTATTTATTGGAGATAATCTAGAACACAGGCAAAATCC^ 
ATGACATCACTTGTACAAAATAAACAAATAACAATGTGAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAA 
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